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ABSTBACT 

This paper describes seven experiaents related to 
huaan coaannication research. The first two ezperiaents discuss 
studies treating the aural responses of listeners. The third 
experiaent was undertaken to estiaate the inforaation of sounds and 
diagraas which aight lead to an estiaate of the redundancy ascribed 
to the phonetic structure of words. A total of 1549 words of one 
syllable and 2151 of two syllables were transcribed phonetically 
using ai phonetic syabols* Proa this experiaent it was concluded that 
the intelligibility of a word is aost closely associated with the 
intelligibility of the initial^ounds of a word. The fourth 
experiaent was undertaken with^contextual aaterial and short phrases 
froa two sources: flight patter and newspapers. The fifth experiaent 
was related to letter prediction. The sixth experiaent used single 
words in different contexts. The seventh experiaent was concerned 
with the nature of the distribution of responses. (TS) 
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INFORMATION THEORY: A METHOD FOR HUMAN COMMUNICATION RESEARCH 

John W. Black, The Ohio State TJhi varsity 
SCA Chicago 197^ 

1. George Miller and Patricia Nicely reported a study In 1955* that 

has become a standard amonp; works on speech perception. Sixteen consonants 
were spoken initially before the vowel /a/ in a list of 200 nonsense syllables. 
This list was recorded by each of 5 female speakers. Four listeners wrote the 
syllables they belie. 'ed they heard, providing 800 syllable-response events per 
talker for which confusions could be studied. Pooling the five talkers, gave 
U,000 observations at each condition that was tested. The conditions were 6 
signal-to-noise ratios, -l8, -12, -6, 0, +6, +12 dB. Furthermore, the band- 
widths were varied to provide low-pass cutoffs at 300, UOO, 600. 1200, 2500, 
and 5000 Ha and high-pass cutoffs at. 200, 1000 , 2000, 2500, 30C-, and U5OO Hz. 
The responses were published in 18 confusion matrices* Supposedly this work 
dates the origin of the terms confusion matrix , error matrix , response matrix , 
and the like. The scope of the work and the reporting of the results are only 
two of several reasons why the study is considered a classic within the area 
of concern. Another reason is that the consonants were studied in terms of 
a system of distinctive features, specifically a system that included -oicing, 
nasality, affrication, duration, and place of articulation. When I say 
"studied" I refer more specifically to the fact that each distinctive feature 
of a consonant was considered as a separate channel, and the ratio betvreen Jhe 
input and output of each channel was determined for each of the I8 experimental 
conditions. The most satisfactory experimental condition was bandwidth 
200 - 6500 Hi and signal-to-noise ratio 12 dB. Under this condition voicing was 
conveyed properly 97? of the time, nasality 100^, afftrication duration 93^> 
and place of articulation 71^. The fact that these channels summated vO more 
than the theoretical whole v;as attributed to interaction or "crosstalk". The 
presence of one feature made the presence of another feature more likely. This 
crosstalk is what we are experiencing today as we inadvertently slip into the 
area assigned to one of the other speakers. In the parlance cf the day we are 
calling this redundancy. 

H. The Miller and Nicely study illustrates a procedure that is mandatory 

as far as I know in working with information theory: a process of absolute 
^Judgment or of identifying a stimulus. Ify high regard for the work of Miller 
and Nicely is shotm by the fact that Sadanand Singh and I used it as a model 
in a study of speech production and speech perception by four language grouns. 
Twenty-six intervocalic consonants were recorded by three speakers of each of 
four languages, Hindi, English, Arabic, and Japanese and heard by 2k sneakers 
of each of them. The data were treated in two ways, 1) an analysis of variance 
of correct responses (all speakers spoke better and all listeners listened 
better when saying and hearing sounds of their native language)) and 2) the 
quantitative procedure employed singularly by Miller and Nicely — our tot^ic 
today. This was adapted to ascertain which features were retained by the 
listeners in their error responses and whether or not these were similar from 
one language grouo to another. We used se 'en channels instead of five and, 
of course, had four language grouos instead of one* The striking outcome lay 
in the rank order of the seven channels in the relative amounts of information 
per channel, or the relJiti'-e imnortance of the channels. A single rank order 
in this regard obtained for all of the listening grou-'s: 1) nasality, 
2) place, 3) liquidness, U) voicing, 5) duration, 6) friction, and 7) aspiration. 
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This outcome varied in one important detail from the related result of Miller 
and Nicely. With them, place of articulation was the hardest to hear correctly. 
Tn our instance, it tias the second easiest. 

HI. 'Stie two QiQBW^ding studies treated the aural responses of listeners. 
I cc»ie now to an approach to our code of communication. 'Rtis piece of work 
I'm delighted to have behind me. First, I am pleased with it, and second, I 
would never want to tackle it again. The study was a finger exercise in 
applying elementary techniques of information theory to phonetic probability. 
One- and two-syllable isolated words were sampled with respect to (a) the 
relative frequency of speech sounds, p (jL) (*the probability of sound i^')> 
(b) the relative frequency of occurrence~of the sounds at different pos^itions 
in the v/ord, (c) the probability of two sounds occurring in succession, 
p (i^, ^) {*the 3oint probability of*a sequence of sounds designated i and ^*), 
(d; the probability that one sound follows another, pi (iL) (('the conditional 
probability that sound ^ follows sound i^*), and (e) tHe probability that a 
sound will precede another, (i) ('the conditional probability that sound ± 
precedes sound j'). Words of one- and two-syllables and of differing numbers 
of sounds were treated separately. Tkie princinal objective was to estimate 
the information of sounds and digrams. This would lead to an estimate of the 
redundancy that light be ascribed to the phonetic structure of words. A 
secondary objective was enumerative, to find the probability of the sounds 
and digrams represented by the sample. 

The narrowness of the problem is emphasized. The sample was not 
continuous language and it might be regarded as zero-order approximation to the 
language of speech, i.e., an array of words not weighted by their probability. 
There were 1^U9 words of one syllable and 21^1 of two syllables. They had 
been selected from words of Thorndike ratings 1-10 though excluding homonyms, 
homographs, and words of greatest and least intelligibility. The words were 
transcribed phonetically using hi phonetic symbols. Thus there were 5.35 
bits per phoneme. Let me summarize a few results. 1) The average information 
per sound in bits was: 

One-syllable words 

i-sounds ^-sounds 
3-sound " 5.0^1 U.65 

Wound ^.15 ^•^O 



Two-syllable words 
i-sounds J-sounds 
l»-sound U.68 U.31 

5- sound kM U.I5 

6- sound U.UO U«33 

2) Computation of the information of the digrams yielded the following 
information per symbol: 

One-syllablevwords Two-syllable words 

3-sound k,21 .... 

U-sound 3.35 3.89 

5- 80und .... 3.89 

6- 80und •••» 3.75 
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5,35 


U,85 


k»2l 


5.35 




3.35 


5,35 




3.89 


5*35 


U.32 


3.89 


5.35 


U,37 


3.75 



-3- 

3) The decrements in information from a) maximum (5.35 bits) to b) observed 
average bits per symbol to c) observed bits per digram symbol follow; 

3-8ound> 1 syllable 

U-sound, 1 syllable 

U-sound, 2 syllable 

5- 80und, 2 syllable 

6- sound, 2 syllable 

Bie fact that the relative uncertainty of successive sounds decreases 
in the course of a word provides an interesting parallel — perhaps a 
coincidence — > to some results X obtained in a different line of research. 
Briefly, the intelligibility of a word was found to be most closely associated 
with the intelligibility of the initial sounds of a word. The parallel is 
striking in that in the present instance the greatest uncertainty In the 
phonetic complex of a word is at the outset. 

IV, A somewhat complicated approach to the code of communication was 

undertaken with contectual material, short phrases from two sources: flight 
patter and newspapers. Approximately kOO new students in a naval flight 
training program served as experimental subjects, each responding to U, 5, or 
6 phrases. One group was told the source of each phrase, newspaper or flight 
patter. The second group heard the phrases described as "newspaper language". 
In keeping with Shannon's original procedure, a subject was encouraged to 
continue guessing successive letters in a phrase through ten trials if 
necessary and then was told the letter if he had not guessed it. He kept a 
record of his guesses and of the correct letters and was told when the phrase 
ended. The corpus had the following dimensions: 

Flight phrases Newspaper phrases 

Total phrases and 

total syllables 6U; 320 6k; 320 

Length in typewriter- 
spaces and median 

length in spaces • 13-25; 19 lJ>-23; 19 

Total words and number 

of different words 260; lUO 2k0; IkQ 

Total letters 1252 1201 

On the average uncertainty decreased throughout a phrase fran 
approximately k bits for the first space to approximately 1.2 bits for the 
21st space. The correct predictions incf eased from approximately 25^^ in the 
first word to 6% in the fifth word. These results are in keeping with ones 
obtained by Shannon and with our intuitive notions of the power of context. 
Let me note that these results were obtained from the averages of several 
phrases surfer iranosed, one unon another. I have recently been somewhat dis- 
illusioned in this regard through the results of 10 predictors working with 
the same naragre?>hs. I have not extended the results to measures of infor- 
mation. That will come later, perhans much later. However, t^he first nredictions 
of the letters of words throughout the raragranh — and this was carried 
through to 10 F^aragraphs from varied sources, each paragranh of approximately 
UOO spaces — showed little effect of context. Using a signal-detection model, 
I found no effect of context beyond 15 spaces. There is still the possibility 
that measures of information vill show the rising function that I have described 
o with ncvrspaper and flight pnttcr phrases and with which we are familiar. 
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V, One more study that is related to letter prediction. This was one 

component in a comparative study of the styles of Rufus Choate, his contemner* 
ariest and our con temporaries — all orators* Again the corpus was phrases » 
50 from each source. The phrases included at least 30 typewriter spaces, were 
randomly selected, and were not necessarily at the outset of a sentence* Each 
phrase was predicted, letter-by-letter by 10 uni%'er8ity students, m contrast 
to the preceding procedure a predictor was given the correct response after 
each prediction* The uncertainty conveyed by the predicted first letter of 
words 1, 2, 3> and k of the materials provide an interesting contrast. 

Choate's 20th Century 

Choate Contemporaries Speakers 

(Ist word) (2.67U) (2.6?^) (2.6?^^) 

2nd word 2.526 2.168 2.269 

3rd word 2.371 I.878 2,30U 

kth word 2.516 2.314 2.158 

Mean 2.^71 2,120 2.2UU 

My first observation would be that knowing one word of a Choate phrase provided 
fewer clues to the next word than knowing the first word of phrases of other 
speakers. The same was true when knowing the second word; and when knowing the 
thttd word. I sometimes view this process as the ability of a listener to 
track a speaker, m this parlance Choate was more surprising from word to 
word than either his contemooraries or our contemporaries. This surprise was 
measured by 20th Century university students* 

VI» Many people are impatient with letter predictions. Even a word-to- 

word approach may be too atomistic to satisfy them. I am working in the next 
instance with 50 separate words and putting them in 5 contexts. 

Context 1: The word in isolation. 
Context 2: The word repeated in isolation. 

Context 3: The word in conjunction with a warning that the word is not a 

closely identified word aurally. 
Context U: The word in 2nd order word approximation. 
Context 5s The word in 3rd order word approximation. 

(These last two were determined in the manner of Miller and Self ridge.) The 
criterion score was word intelligibility as determined by ?5 listeners who 
heard the materials in a signal-to-noise ratio of 2 dB. Maximum uncertainty 
would be represented by 25 different word responses by the 25 listeners; 
minimum uncertainty by a single response on the part of all listeners. The 
range coHild be from U.65 bits to 0 bits in a response set to one stimulus. 
The obtained results were: context one, isolated word 2.68 bits; context tv^o, 
repeated word, 2.32 bits; context thr^e, word followed by a warning not 



(the word that had been written most Tiequently in error), 2.19 bits; context 
four, second order word approximation, I.78 bits; context five, third order 
word approximation, l.UO bits. 

VII# I come now to a different kind of application of information theory, 
the information conveyed by a closed message set as illustrated by a multiole- 
choice test. I have a series of multiole-choice intelligibility tests. 
Pooled, these are comprised of 112U items. Each listener's response set 
incluies h v;ords per item, the test word and the three words that were vritten 
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most frequently in error when the test wor-is were used In write-down intelli- 
gibility testing?. Thus the available foils for prew are drew , crew , and groove, 
niese tests were recoried by k speakers and heard by 32 listening panels of 
12-15 members. Sixteen of the naiiels heard the list in quiet lines with the 
signals at 60, ^0, UO, and ^0 dB listening- level i and 16 heard the lists at 
12, 8, h, and 0 dB signal-to-noise ratio. With dlecreased level the ratio of 
the uncertainty of the response to maximum uncertainty increased from 17 to 2k 
to 38 to 89^}; with deteriorating signal-to-noise ratio the percentages increased 
from 22 to 30 to 3k to 77^» Here, one is concerned with the nature of the 
distribution of the responses. As the four available responses become more ani 
more equally probable, the uncertainty conveyed by them increases to approxi- 
mately 2 bits* I anticipated on the basis of earlier work that the principle 
of a constant ratio among the three error responses would be demonstrated with 
changing listening circumstances* This was not the case* However, the 
prospect invites further investigation, 

VIII* I have illustrated only a few applications of the use of 'information 
theory in human communication* These illustrations have included uses of 
Shannon's first method, that is allowing a predictor to continue predicting 
until all possibilities are exhausted, and his 2nd method, that is giving the 
experimental subject the correct response after each prediction* I have 
illustrated the use of information theory with word responses, letter responses, 
and phoneme responses* The focus of interest has varied from context to 
distributions of available resronses and has included inferences about an 
ill-defined coride'it, style* The source of the materials investigated has been 
varied and the effect of knowing the source as opposed to not knowing it has 
been measured* By inference it seems clear that a prerequisite for the use 
of information theory is a closed message set* Whether this exhaustive 
enumeration is an alphabet cr the lines of reasoning of Aristotle is 
inconsequential* 

The limited examples have made no reference to one important aspect 
of information transmission, namely the rate of transmission, I hhall not 
expand on this omission* It has net come within my ken* 
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